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Abstract 

Exploiting the information provided by the molecular 
noise of a biological process has proven to be valuable 
in extracting knowledge about the underlying kinetic pa- 
rameters and sources of variability from single cell mea- 
surements. However, quantifying this additional informa- 
tion a priori, to decide whether a single cell experiment 
might be beneficial, is currently only possibly in very sim- 
ple systems where either the chemical master equation is 
computationally tractable or a Gaussian approximation 
is appropriate. Here we show how the information pro- 
vided by distribution measurements can be approximated 
from the first four moments of the underlying process. 
The derived formulas are generally valid for any stochas- 
tic kinetic model including models that comprise both 
intrinsic and extrinsic noise. This allows us to propose 
an optimal experimental design framework for heteroge- 
neous cell populations which we employ to compare the 
utility of dual reporter and perturbation experiments for 
separating extrinsic and intrinsic noise in a simple model 
of gene expression. Subsequently, we compare the infor- 
mation content of different experiments which have been 
performed in an engineered light-switch gene expression 
system in yeast and show that well chosen gene induction 
patterns may allow one to identify features of the system 
which remain hidden in unplanned experiments. 



Introduction 

Quantitative studies of biological systems with mathe- 
matical models strongly depend on an appropriate char- 
acterization of the underlying system, that is on good 
knowledge about the underlying mechanisms and kinetic 
parameters. While extracting such knowledge from aver- 
aged cell population data is common practice, it has only 
recently been realized that also the molecular noise ob- 
served in single cell measurements may be a rich source 
of information about the parameters of stochastic kinetic 
models [TJ [2J [3J H] . Mathematically, the information pro- 
vided by single cell measurements can be quantified with 
the Fisher information. To compute the Fisher informa- 
tion for stochastic kinetic models one has to solve the 
chemical master equation which governs the time evolu- 
tion of the probability distribution of the underlying pro- 
cess [S]- This is, however, only possible in the simplest 
cases and even approximation techniques either remain 
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limited to very small systems [5] or are based on strong 
assumptions \E\ which are rarely fulfilled in real appli- 
cations. Consequently, experiments are usually designed 
based on the intuition of the experimenter, rather than 
on information theoretic criteria. 

A second difficulty in the analysis and design of sin- 
gle cell experiments is that stochastic kinetic models are 
usually based on the assumption that the same process 
governs the evolution in all cells of the population. How- 
ever, in reality cells are different, due to different cell size, 
local growth conditions, expression capacity or several 
other factors [21 [TU]. This so-called extrinsic variabil- 
ity [TTl [T21 ITS] makes the process evolve slightly differ- 
ent in each cell, often leading to additional variability in 
the response of the cell population. Indeed, in many in- 
stances noise resulting from such extrinsic variability has 
been reported to dominate molecular noise [TTl [151 ITS] . 
In such situations methods that assume a homogeneous 
cell population and attribute all the observed variability 
to molecular noise may lead to biased results. Extrinsic 
variability does not necessarily refer to "extrinsic to the 
cell" but extrinsic to the model developed for a specific 
process. Some of this variability may be static for the 
time scales of interest. For example, the number of mi- 
tochondria affecting translation [3] changes much more 
slowly than species in signaling and transcription cas- 
cades which form the heart of the model. In this case 
the number of mitochondria can be taken as random but 
constant in time for the purposes of the model. Some of 
the extrinsic variability, however, may be due to species 
which are not included in the model but affect reaction 
rates and evolve on time scales comparable to those of 
the reactions of interest [TTl El HE] , f° r instance global 
regulators affecting transcription of genes of interest. In 
theory one should include such species in the model but 
this is often not practical since it would lead to models 
of intractable size. A convenient modeling abstraction 
may then be to include an assumed stochastic process for 
some of the rates, to serve as a rudimentary abstraction 
of the complex mechanisms governing the fluctuations of 
the reaction rates [10] . 

In this paper we develop a framework for optimally de- 
signing single cell distribution experiments which is ap- 
plicable in the presence of possibly time- varying extrinsic 
variability and scalable to systems of realistic size. To 
this end we first demonstrate on systems where one re- 
action rate is governed by a stochastic differential equa- 
tion how equations describing the time evolution of the 
moments of the probability distribution can be derived 



for systems which are influenced by time-varying extrin- 
sic variability. We then show how the Fisher information 
can be approximated from the first four moments with- 
out the need of any assumption other than a sufficiently 
large measured cell population. Finally, we embed the 
approximated Fisher information into an optimization al- 
gorithm which returns the most informative experiment 
for a specified set of model parameters. This allows us 
to design optimal experiments for identifying specific fea- 
tures of the system. We demonstrate this by comparing 
dual reporter and perturbation experiments in a simple 
model of gene expression where the mRNA production 
rate is varying according to a stochastic differential equa- 
tion. Finally, we study the variability in an engineered 
light-switch gene expression system in yeast. We use our 
methodology to evaluate the experiments that were per- 
formed in |19) and show that they strongly differ in the in- 
formation provided about the unknown parameters. Fur- 
ther, we show that an experiment found by employing our 
optimal experimental design procedure would lead to far 
more information than any of the experiments reported 
in [TO]. 

Methods 

Moment equations for reaction systems in fluctu- 
ating random environments The time evolution of 
the probability distribution of stochastic kinetic models 
is governed by the chemical master equation (CME). If 
extrinsic variability is present in a population, the distri- 
bution for each cell can be described by a CME which is 
conditioned on the realizations of the extrinsic variables 
in each cell. In [3] it was shown how population moments 
can be computed from this conditional CME under the 
assumption that the extrinsic variables are constant in 
time. 

More generally, assume now that extrinsic variability 
can be modeled by a reaction parameter at governed by 
a stochastic differential equation of the form 



dat = r(fi a — at)dt + Sy/atdWt. 



(1) 



where Wt is a standard Brownian motion. This process 
fluctuates around its mean ^t a , where the mean reversion 
speed r gives the autocorrelation time of the process and 
thereby determines the time scale of the rate fluctuation. 
The noise coefficient s determines the size of the devi- 
ations from the mean, whereas the term yjal prevents 
the process from taking negative values and is in accor- 
dance with the frequently used Langevin approximation 
for chemical reaction networks [5]. Note that this for- 
mulation includes constant extrinsic variability since for 
r = s = at is constant and distributed according to its 
initial distribution. 

The system which jointly describes the time evolution 
of the species and the extrinsic variable is a stochastic 
hybrid system |20] , The time evolution of the moments 
can be computed as 



-E[ip{at,x(t))] 



E[(L^)(ot,x(t))] 



(2) 



where x(t) = [xi(t) ■ ■ ■ x m (t)] is a vector containing the 
molecule counts of the m species, ip : K x R m — > R is 



chosen such that the left hand side gives the derivative of 
the desired moment and L is the extended generator of 
the stochastic hybrid system, given by 



(Lip)(a,x) 



l d 2 jj(a,x) 2 
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da 

K 

+ y^ (4>{a,x + Vi) -%l)(a,x))ai(a,x), 



where f(a) = r(fi a — a), i/j are the stoichiometric transi- 
tion vectors and ai(a,x) the propensities of the K reac- 
tions. Note that the resulting system of moment equa- 
tions may be non-closed in the sense that the time evo- 
lution of the moments of any order depends on moments 
of higher order. In such cases the moments cannot be 
computed exactly and approximation techniques have to 

be used [211122122. 

Approximating the Fisher information The 

amount of information about model structure or pa- 
rameters which can be gained from measurements may 
be highly dependent on the experimental setup that 
is chosen [21 1251 [23 123 123- Carefully planning an 
experiment reduces experimental effort and resources and 
may even allow one to answer questions which cannot be 
answered from unplanned experiments. 

Predicting the information about a vector of unknown 
model parameters 6 — [9\ ■ ■ ■ 6n] that an experimental 
setup can supply requires computation of the Fisher in- 
formation matrix 1(6), whose elements are given by 



(i(e))u = e 
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log f(Y; l 



where Y is the random variable which is experimentally 
measured and f(Y;8) is its distribution. In stochastic 
kinetic models the parameter vector 9 typically contains 
reaction rates and, if extrinsic variability is present, ex- 
trinsic parameters such as moments of the extrinsic dis- 
tribution [3] or parameters describing the fluctuations of 
the extrinsic variables. Population measurements, such 
as those provided by flow cytometry, can be viewed as 
a large number of independent samples Y\ , . . . , Y n which 
are drawn from a marginal distribution f(Y; 0) of the un- 
derlying process. The computation of f(Y; 9), and there- 
fore also the computation of the Fisher information ma- 
trix, requires the solution of the CME. As this is usually 
not available, methods which can be used to approximate 
the Fisher information matrix using Monte Carlo simula- 
tions have been proposed [291 130| . Such approaches are, 
however, computationally expensive and hardly feasible 
in most applications. 

An alternative approach resorts to a Gaussian assump- 
tion on the underlying Markov process [311 152] . Under 
this assumption, the sample mean and variance of the 
measured population form a jointly sufficient statistic. 
Hence, computation of the Fisher information reduces to 
solving the differential equations that describe the dy- 
namics of mean and variance of f(Y;9) and computing 
their partial derivatives with respect to 9. There are, 
however, many systems where a Gaussian assumption is 
not reasonable [3J [33] • In such cases the method in [5T] 
may lead to erroneous results for several reasons. First, 



the computed means and variances of f(Y; 9) may be in- 
accurate. Second, information that can be gained from 
higher order moments is neglected. And third, the Gaus- 
sian approximation implicitly assumes that sample mean 
and variance provide independent pieces of information, 
an assumption which is violated for all non-Gaussian dis- 
tributions [34]. For instance, for a Poisson distribution 
the sample mean is already a sufficient statistic on its 
own and the variance adds no new information (see Sup- 
porting Information Section S.1.2). 

In situations where a Gaussian assumption is not ap- 
plicable it may still be possible to approximate the infor- 
mation which is provided by sample mean and variance. 
If the sample size is sufficiently large, the central limit 
theorem implies that sample mean and variance are ap- 
proximately jointly Gaussian. For simplicity, assume that 
there is only one unknown parameter 9 and that only one 
species is measured (a more general case is treated in 
the Supporting Information Section S.1.3). The informa- 
tion given by the mean I m (9) and the joint information 
given by mean and variance Ij{9) can then be approxi- 
mated using the special form of the Fisher information for 
multivariate Gaussian random variables (see Supporting 
Information Sections S.I.I and S.1.3), which results in 
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(3) 



, (4) 



where n is the size of the sample, ^i denotes the mean 
and iik, k = 2, ..., 4 the central moments up to order 4. 

These formulas are valid for any distribution which sat- 
isfies the requirements of the central limit theorem. Fur- 
ther, it can be shown [35] that I m (9) and Ij(9) provide 
lower bounds on the information of the whole sample. For 
a Gaussian distribution, since /13 = and 114 — 3/4, Ij{9) 
reduces to the correct expression for the complete infor- 
mation. For a Poisson distribution, since /Ui = (X2 = H3, 
I j (9) reduces to I m (9), which gives the correct expression 
for the complete information. 



Designing optimal experiments The goal of experi- 
mental design is to find the experiment which is optimal 
according to some criterion reflecting information about 
the unknown parameters. The most frequently used cri- 
teria are D-optimality, A-optimality and _B-optimality 
which correspond to maximizing the determinant, min- 
imizing the trace of the inverse and maximizing the min- 
imal eigenvalue of the Fisher information matrix, respec- 
tively. In biological applications, especially in models 
which include extrinsic and intrinsic variability, it may 
be desirable to design experiments which are targeted to 
specific parameters or to subsets of the parameter set. 
For instance one might want to estimate the kinetic pa- 
rameters of the model as well as possible, despite the 
presence of extrinsic noise. Or conversely, one may be 
more interested in understanding the extrinsic variability 
only. An optimality criterion targeted to such questions 
is D s -optimality |36l I37| . It is based on partitioning the 



parameter vector 9 = \9\ #2] T in parameters of interest 
9\ and nuisance parameters 9 2- The experiment, which 
allows one to obtain the confidence region with minimum 
volume for 9\ can then be found by maximizing the de- 
terminant of 

I s (9) = / n (0) - h 2 (9)h2(9)- 1 l2i(9), where 

hi(9) Iu(9) 
hi{9) h2(9) 



1(9) 



(5) 



I\\(9) and I 2 2(9) are the information matrices for 9\ and 
#2, respectively and 1 12(6) and l2i(9) give the cross terms 
between 9\ and #2- 

The computation of I s (9) requires knowledge of the 
true parameters 9, which are not available. This diffi- 
culty can, for instance, be overcome by evaluating the 
information at some initial estimate 9. If, however, this 
initial estimate differs significantly from the true param- 
eter vector, the resulting experiment may be far from op- 
timal. This is especially important for biological appli- 
cations where initial estimates, if available at all, usually 
involve large uncertainties. Here we chose an approach 
which includes the uncertainty of the initial estimate in 
the form of a prior distribution n(9) and computes the 
expected information with respect to ir(9) [381155] (for an 
overview of other methods see Supporting Information 
Section S.4). The corresponding optimal experiment e* 
can then be obtained by solving the following optimiza- 
tion problem: 



argmaxjEg [det I s (9, e)]}, 



(6) 



where the expectation is taken over 9 ~ 7r(0), Is(9,e) is 
the information matrix for experiment e evaluated at 9 
and £ is the space of possible experiments. We can now 
state a procedure for designing optimal experiments for 
the estimation of parameters of stochastic kinetic mod- 
els from measurements of a heterogeneous cell population: 



The optimal experimental design procedure 

• Include extrinsic variability by taking reaction 
rates as stochastic processes as in Eq.pQ. 

• Derive the differential equations for the required 
moments using Eq.[5]. If the moment equations 
are non-closed and cannot be solved exactly use 
an approximation method [21, 22J. 

• Solve the differential equations to compute the 
moments and their partial derivatives with re- 
spect to the parameter vector as functions of 9 
and t. 

• Choose a vector 9\ of parameters of interest and 
specify the distribution ir(9) according to prior 
uncertainty about 9. 

• Solve the optimization problem Eq.[B], where the 
total information / s (9, e) is replaced by the ap- 
proximated information of sample mean and vari- 
ance according to Eq.[4]. 



Some comments on practical applicability of this proce- 
dure are given in the SI Appendix Section S.1.4. 

This optimal experimental design procedure can be per- 
formed in iterations with experiments. Starting from 
some prior distribution tt(9), the computations lead to 
optimal experiments that yield data which can be used 
in a parameter inference scheme [3] to compute posterior 
distributions. These can then in turn serve as new prior 
distributions for the computation of a new optimal exper- 
iment. This can be continued until the uncertainty about 
the parameters has been sufficiently reduced. 

Results 

In silico study of a gene expression system We 

demonstrate the proposed experimental design framework 
on a simple example of gene expression. The model we 
consider consists of the two species mRNA (M) and pro- 
tein (P) and the four reactions 



a t ■ g(t) 



M 



M 



M 



■+ M + P 



where b — 0.03, c = 0.5, d — 0.04 and at is varying accord- 
ing to a stationary stochastic process of the form Eq. pQ . 
The dynamics of the moments of order up to two of M 
and P are then completely determined by the mean rever- 
sion speed r and mean /i a and variance V a of the station- 
ary distribution of at (see Supporting Information Section 
S.2.1). Here we assume that the values of these param- 
eters are \i a = 0.5, V a = 0.1 and r = 0.005. We further 
assume that the gene can be switched between an on state 
(where g(t) = 1) and an off state (where g(t) = 0) using 
some external input, for example by adding different nu- 
trients in nutrient shift experiments [40], by adding salt 
to induce the osmotic stress response [UJ, or with light 
pulses |19] . Further, throughout this section we assume 
that it is known that no molecules are present at time 
t = (loosely speaking the gene has been off for some 
time at the start of the experiment) and that the degra- 
dation rates b and d are known, whereas /i Q , V a , r and 
c have to be determined from the measurements. Fur- 
ther, for simplicity, all the computations of this section 
are performed locally at the "true" parameter values and 
prior uncertainty about the parameters is not included. 

In the following we compare four experimental methods 
in terms of the information they can provide about the 
unknown parameters. For all methods we assume that 
the experiments are limited to a time length of t = 300 
time units and that at most ten measurements of the pro- 
tein distribution are taken during that time. The first 
two methods we consider are standard time course exper- 
iments, where the gene is switched on only at time zero 
and measurements are taken in regular time intervals (ev- 
ery 30 time units). In the first method (referred to as 
unplanned experiments) we assume that a single reporter 
protein is measured, whereas in the second method (un- 
planned dual reporter experiments) an identical copy of 



the gene is added to the cells, such that a second reporter 
protein which is conditionally independent of P, given 
the history of at, can be measured [HI HUES]- These two 
methods are compared to more sophisticated experiments 
where informative gene switching patterns and measure- 
ment times were identified using our experimental design 
framework. Thereby, the searches for the most informa- 
tive experiments were performed using a Markov chain 
Monte Carlo-like algorithm, where we first searched for 
the optimal gene switching pattern for equally spaced 
measurement times and then sequentially placed the mea- 
surement times where they yielded maximal information 
for the previously found gene switching pattern. More 
details on this algorithm are given in the Supporting In- 
formation Section S.2.3. Again, we consider both single 
and dual reporter experiments (referred to as optimal per- 
turbation and optimal dual reporter experiments, respec- 
tively) . Figure 1 gives the resulting optimal perturbation 
experiments targeted at the parameters r and V a (results 
for the remaining parameters and results for the optimal 
dual reporter experiments are given in Figures S.2 and S.3 
in the Supporting Information). It can be seen that dif- 
ferent perturbations and measurement times are optimal 
for identifying different parameters. 
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Figure 1: Optimal perturbation experiments for the param- 
eters r (A and B) and V a (C and D). (A,C) Gene switching 
patterns found by employing the experimental design procedure. 
A value of one corresponds to the gene being switched on, a value 
of zero to the gene being switched off. (B,D) Measurement times 
which attain the highest information for the gene switching pat- 
terns of A and C, respectively. Information is normalized by the 
sample size n. In both panels the first two measurement times 
were placed at t = 150 and t = 300 (blue circles in the bot- 
tommost curve). The bottommost curve shows the information 
which would be obtained by placing a third measurement at dif- 
ferent time points between t=0 and t=300. The measurement is 
then placed at the time point where the information is maximal 
(indicated by the red circle on the bottommost curve) and the 
procedure is repeated for the next measurement. From bottom 
to top measurements are being placed one at a time (red circle 
on the corresponding curve) and the total information increases. 
The blue circles show the location of the measurements that were 
already placed. 

The results of the comparison of the four methods are 
summarized in Table 1. From the first column we see that 
the information which can be gained from unplanned ex- 
periments is very small. This indicates that the param- 



Table 1: Comparison of different experimental approaches. Information normalized by the sample size n. Rows: Information for 
different parameters of interest. Columns: Information which can be gained by different experimental approaches. Computations 
corresponding to unplanned experiments were performed with the gene being switched on only once at time zero and equally spaced 
measurement times. Optimal experiments include optimal gene switching patterns and sequentially placed measurement times (see 
Supporting Information Section S.2.3). 

Unplanned experiment Unplanned dual reporter Optimal perturbations Optimal dual reporter 



Ma 


0.0037 


V a 


0.0185 


C 


0.0037 


r 


48.5647 



10.3106 
18.6882 
11.3211 
271.5223 



4.4499 
12.8646 

4.7267 
515.6095 



10.8805 

36.9019 

12.4333 

975.4253 



eters may be practically unidentifiable. Unplanned dual 
reporter experiments, on the other hand, lead to much 
more information (second column in Table 1) and appear 
to be suitable for identifying both the extrinsic and the in- 
trinsic parameters of the system. The information of the 
optimal perturbation experiments is given in the third 
column of Table 1. It can be seen that, compared to 
dual reporter experiments, more information is obtained 
for the parameter r, whereas dual reporter experiments 
lead to more information for fi a , V a and c. Hence, de- 
pending on the objective of the study, different experi- 
mental strategies are preferable. The fourth experimental 
method, which combines optimal perturbations and dual 
reporters, naturally leads to the most information for all 
parameters (fourth column in Table 1). Note, however, 
that the increase in information compared to unplanned 
dual reporter experiments is very small if fi a or c are the 
parameters of interest, which indicates that additionally 
perturbing the system may not be worth the effort. 

Finally, we also computed the information for the first 
two methods under a Gaussian assumption. Our results 
(Supporting Information Table S.l) show that for many 
objectives a Gaussian assumption leads to information es- 
timates which are overly optimistic. This is most likely 
due to the independence assumption of sample mean and 
variance which is implicitly imposed by a Gaussian ap- 
proximation. 

Characterizing extrinsic noise in a light-induced 
gene expression system Next we study a light-switch 
gene expression system which has been engineered in 
yeast [19) . The authors used a light responsive mod- 
ule to initiate transcription by shining red light on the 
yeast culture and to terminate transcription by shining 
far-red light. They then proposed a control scheme to 
regulate the mean amount of protein in the population. 
The development of more sophisticated control schemes 
(for example, to allow one to also control the protein vari- 
ance) requires a proper characterization of the sources 
of variability in the system. To this end our framework 
can serve to compare the utility of different experiments 
and ultimately to design the experiments which are opti- 
mal for the characterization of the different noise sources. 
We thus developed a stochastic version of the model in 
|19| with time varying extrinsic variability. This intro- 
duced four additional parameters which were not iden- 
tified in |19j . To characterize uncertainty about these 
parameters we chose independent uniform prior distribu- 
tions and computed the expectation of the information 
which is provided by the experiments reported in Figure 



1 of [in] about each of the additional parameters accord- 
ing to Eq.[5]. Thereby, the remaining parameters which 
were already identified in |19| were fixed to their known 
values (see Supporting Information Section S.3.1 and Sec- 
tion S.3.2). The results are shown in Tables S.2 and S.3 
in the Supporting Information. Which experiment is best 
again depends on the objective. For instance, the exper- 
iment where a red light pulse is applied at the beginning 
and a far-red light pulse after 30 minutes is best to iden- 
tify the protein production rate but worst for all other 
parameters. This is most likely due to the fact that if 
the gene is switched off, the mRNA production rate is 
set to zero and the parameters describing this rate do not 
influence the dynamics anymore. 

Further, our results show that even though the exper- 
iments in Figure lc of |19) were performed over a longer 
time and contain more measurements than the experi- 
ments in Figure le of |19j . the experiments in Figure le 
provide much more information about the mean reversion 
speed r of the extrinsic fluctuations. This suggests that 
experiments where the gene is expressed for short time in- 
tervals with silent periods in between could allow one to 
determine r and thus to test whether extrinsic variability 
can be assumed to be constant or whether a stochastic 
process description as used in this paper is required. Our 
results in Section S.3. 3 in the Supporting Information in- 
dicate that indeed, contrary to other experiments, the 
variance measured in the experiments in Figure le in |19| 
cannot be explained very well by a model with constant 
extrinsic variability. 

Finally, we also employed our experimental design 
framework to search for experiments which carry high in- 
formation about r. The light-pulse pattern and measure- 
ment times we determined with our method are shown 
in Figure 2. They lead to an experiment which carries 
close to four times more information than any of the ex- 
periments in |19| suggesting that our experimental design 
framework can be a valuable tool for characterizing vari- 
ability in this system. 



Discussion 

While knowledge about biological mechanisms is con- 
stantly growing, our understanding of the stochasticity of 
biological systems and its influence on system dynamics 
remains rather limited. Many different sources of vari- 
ability may play a role and neglecting any one of them 
may lead to substantial errors. For instance, neglecting 
extrinsic variability in the gene expression model and us- 
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Figure 2: Results of experimental design targeted at the pa- 
rameter r for the light-switch gene expression system. The 

optimal light-pulse pattern is shown at the bottom of the figure. 
Red rectangles correspond to red light pulses, black rectangles 
correspond to far-red light pulses. The upper part of the figure 
shows the optimal measurement times, where the interpretation 
is the same as in Figure 1. 



ing the ideas in |44| to quantify protein burst size and 
frequency from measurements of the protein distribution 
would lead to conclusions which are more than an order 
of magnitude wrong (see Supporting Information Section 
S.2.2). Allowing reaction rates to vary between individ- 
uals in a cell population, either by taking them as un- 
known constant random variables or by allowing them to 
be governed by stochastic differential equations, offers a 
way to incorporate extrinsic variability in a model and 
enables model-based studies of heterogeneous cell popu- 
lations. We showed how the information about unknown 
parameters of such models which is provided by means 
and variances of measured populations can be approxi- 
mated by solving a set of ordinary differential equations. 
The derived formulas apply to any kind of system with the 
only assumption that the measured population is of suffi- 
cient size for the application of the central limit theorem. 
This opens up the possibility to pose many interesting 
questions: do the measurements contain enough informa- 
tion to separate the different noise sources? How much 
information can be gained by measuring the variance in 
addition to the mean? And most importantly: what is the 
most informative experiment? By means of examples we 
demonstrated that unplanned experiments may not con- 
tain enough information to separate the different sources 
of variability and that designing experiments based on in- 
tuition alone may not be sufficient. For instance, placing 
all the measurements either very early or very late in the 
experiment turns out to be optimal for identifying the 
mean reversion speed in our examples (see Figures 1 and 
2), but appears very unintuitive at a first glance. 

Our results (Table 1 and Supporting Information Fig- 
ure S.2) show that the optimal experiments are highly 
dependent on the chosen objective. In some cases intro- 
ducing a dual reporter yields high information, in other 
cases perturbing the system with input stimuli is prefer- 
able. A study of the system using the experimental design 
framework presented in this paper allows a comparison of 
the different experimental approaches and enables one to 
choose the approach which is most likely to be successful 



for the given objective. The resulting experiments can in 
turn be used to refine the model and to update the pa- 
rameter estimates, giving rise to an iterative procedure of 
successive rounds of computations and experiments. 

In the light-switch gene expression system the compu- 
tation of the information contents of the different exper- 
iments shows that perturbing the system with different 
light pulse sequences can highlight different features of 
the system. This suggests that well chosen gene induction 
patterns may allow one to uncover features of the system 
which remain hidden in unplanned experiments. For in- 
stance Figure S.4 in the Supporting Information suggests 
that temporal fluctuations in the extrinsic variable may 
play a role for this system. Perturbing a system with 
light pulses to understand the variability may seem to 
be limited to this specific engineered system. However, 
a similar strategy could also be employed by exploiting 
naturally occurring biological mechanisms. For example, 
in [3] the authors studied gene expression in yeast in re- 
sponse to osmotic pressure. Different salt concentrations 
lead to different residence times of the signaling molecule 
Hogl in the nucleus and thereby created different input 
signals to the downstream gene expression system. In 
that system multiple subsequent salt stresses, which can 
for instance be implemented using a micro-fluidic device 
as in |41) . could serve as the equivalent of the multiple 
light-pulses used in this paper and might give further in- 
sights into the specific nature of the system. 
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